A Methodology for Empirical Performance Evaluationof Page Segmentation AlgorithmsSong
نویسندگان
چکیده
Document page segmentation is a crucial preprocessing step in Optical Character Recognition (OCR) systems. While numerous page segmentation algorithms have been proposed , there is relatively less literature on comparative evaluation | empirical or theoretical | of these algorithms. For the existing performance evaluation methods, two crucial components are usually missing: 1) automatic training of algorithms with free parameters and 2) statistical and error analysis of experimental results. In this thesis, we use the following ve-step methodology to quantitatively compare the performance of page segmentation algorithms: 1) First we create mutually exclusive training and test datasets with groundtruth, 2) we then select a meaningful and computable performance metric, 3) an optimization procedure is then used to search automatically for the optimal parameter values of the segmentation algorithms, 4) the segmentation algorithms are then evaluated on the test dataset, and nally 5) a statistical error analysis is performed to give the statistical signiicance of the experimental results. The automatic training of algorithms is posed as an optimization problem and a direct search method | the simplex method | is used to search for a set of optimal parameter values. A paired-model statistical analysis and an error analysis are conducted to provide conndence intervals for the experimental results and to interpret the functionalities of algorithms. This methodology is applied to the evaluation of ve page segmentation algorithms, of which three are representative research algorithms and the other two are well-known commercial products, on 978 images from the University of Washington III dataset. It is found that the performances of the Voronoi, Docstrum and Caere segmentation algorithms are not signiicantly diierent from each other, but they are signiicantly better than that of ScanSoft's segmentation algorithm, which in turn is signiicantly better than that of X-Y cut.
منابع مشابه
Integrating AHP and data mining for effective retailer segmentation based on retailer lifetime value
Data mining techniques have been used widely in the area of customer relationship management (CRM). In this study, we have applied data mining techniques to address a problem in business-to-business (B2B) setting. In a manufacturer-retailer-consumer chain, a manufacturer should improve its relationship with retailers to continue its business. Segmentation is a useful tool for identifying groups...
متن کاملEmpirical performance evaluation of page segmentation algorithms
Document page segmentation is a crucial preprocessing step in Optical Character Recognition (OCR) system. While numerous segmentation algorithms have been proposed, there is relatively less literature on comparative evaluation | empirical or theoretical | of these algorithms. We use the following ve step methodology to quantitatively compare the performance of page segmentation algorithms: 1) F...
متن کاملPersian Printed Document Analysis and Page Segmentation
This paper presents, a hybrid method, low-resolution and high-resolution, for Persian page segmentation. In the low-resolution page segmentation, a pyramidal image structure is constructed for multiscale analysis and segments document image to a set of regions. By high-resolution page segmentation, by connected components analysis, each region is segmented to homogeneous regions and identifyi...
متن کاملEmpirical Performance Evaluation Methodology and Its Application to Page Segmentation Algorithms
ÐWhile numerous page segmentation algorithms have been proposed in the literature, there is lack of comparative evaluationÐempirical or theoreticalÐof these algorithms. In the existing performance evaluation methods, two crucial components are usually missing: 1) automatic training of algorithms with free parameters and 2) statistical and error analysis of experimental results. In this paper, w...
متن کاملA Time-Frequency approach for EEG signal segmentation
The record of human brain neural activities, namely electroencephalogram (EEG), is generally known as a non-stationary and nonlinear signal. In many applications, it is useful to divide the EEGs into segments within which the signals can be considered stationary. Combination of empirical mode decomposition (EMD) and Hilbert transform, called Hilbert-Huang transform (HHT), is a new and powerful ...
متن کامل